Computing Semantic Relatedness in German with Revised Information Content Metrics

نویسندگان

  • Iryna Gurevych
  • Hendrik Niederlich
چکیده

The paper presents an application of information content based metrics to compute semantic relatedness of word senses in German. The main contributions are: an annotation study based on a revised definition of semantic relatedness beyond synonymy, an extension of Resnik’s (1995) procedure for computing information content of concepts for strongly inflected languages, an application of information content based metrics to compute semantic relatedness of German word senses defined in GermaNet (Kunze, 2004) and a new interpretation and normalization function for Jiang & Conrath’s (1997) distance metric. Semantic relatedness metrics consistently outperform two baselines: a Lesk based algorithm, and one using Google word co-occurrence statistics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computing Semantic Relatedness of GermaNet Concepts

We present a system designed to compute the semantic relatedness between a pair of GermaNet concepts (word senses). Five different metrics have been implemented. Three of them are information content based and incorporate the Two metrics constitute the application of a Lesk algorithm (Lesk 1986) to artificial conceptual glosses generated from GermaNet. We show that four metrics correlate very w...

متن کامل

Accessing GermaNet Data and Computing Semantic Relatedness

We present an API developed to access GermaNet, a lexical semantic database for German represented in XML. The API provides a set of software functions for parsing and retrieving information from GermaNet. Then, we present a case study which builds upon the GermaNet API and implements an application for computing semantic relatedness according to five different metrics. The package can, again, ...

متن کامل

Comparing Wikipedia and German Wordnet by Evaluating Semantic Relatedness on Multiple Datasets

We evaluate semantic relatedness measures on different German datasets showing that their performance depends on: (i) the definition of relatedness that was underlying the construction of the evaluation dataset, and (ii) the knowledge source used for computing semantic relatedness. We analyze how the underlying knowledge source influences the performance of a measure. Finally, we investigate th...

متن کامل

Using the Structure of a Conceptual Network in Computing Semantic Relatedness

We present a new method for computing semantic relatedness of concepts. The method relies solely on the structure of a conceptual network and eliminates the need for performing additional corpus analysis. The network structure is employed to generate artificial conceptual glosses. They replace textual definitions proper written by humans and are processed by a dictionary based metric of semanti...

متن کامل

Evaluating Semantic Metrics on Tasks of Concept Similarity

This study presents an evaluation of WordNet-based semantic similarity and relatedness measures in tasks focused on concept similarity. Assuming similarity as distinct from relatedness, the goal is to fill a gap within the current body of work in the evaluation of similarity and relatedness measures. Past studies have either focused entirely on relatedness or only evaluated judgments over words...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005